Clustering Hungarian Verbs on the Basis of Complementation Patterns
نویسندگان
چکیده
Our paper reports an attempt to apply an unsupervised clustering algorithm to a Hungarian treebank in order to obtain semantic verb classes. Starting from the hypothesis that semantic metapredicates underlie verbs’ syntactic realization, we investigate how one can obtain semantically motivated verb classes by automatic means. The 150 most frequent Hungarian verbs were clustered on the basis of their complementation patterns, yielding a set of basic classes and hints about the features that determine verbal subcategorization. The resulting classes serve as a basis for the subsequent analysis of their alternation behavior.
منابع مشابه
First Attempt to Automatically Generate Hungarian Semantic Verb Classes
Aiming to create verb paraphrases to lay the foundation of sentence paraphrases I automatically created Hungarian semantic verb classes with k-means algorithm. The vector representation of verbs was special: dimensions were cases and values were sets of lemmas that can fill the verb frame position defined by the case. I clustered 900 frequent verbs, from which 243 got into 71 smaller clusters, ...
متن کاملDetecting Optional Arguments of Verbs
We propose a novel method for detecting optional arguments of Hungarian verbs using only positive data. We introduce a custom variant of collexeme analysis that explicitly models the noise in verb frames. Our method is, for the most part, unsupervised: we use the spectral clustering algorithm described in Brew and Schulte in Walde (2002) to build a noise model from a short, manually verified se...
متن کاملSemantic Clustering of Adjectives and Verbs Based on Syntactic Patterns
In this paper we show that some of the syntactic patterns in an NLP lexicon can be used to identify semantically ”similar” adjectives and verbs. We define semantic similarity on the basis of parameters used in the literature to classify adjectives and verbs semantically. The semantic clusters obtained from the syntactic encodings in the lexicon are evaluated by comparing them with semantic grou...
متن کاملVerbal complementation: A pedagogical challenge
Errors of verbal complementation are among the most frequent and intractable types of grammatical error produced by ESL learners of all levels. Formal rules help only to a very slight degree, and most of the time one operates on intuition. One avoids constructions such as *Mary avoids to make mistakes. simply on the basis of feel—it sounds and looks odd. Many learners seem to operate on the ‘ec...
متن کاملThe role of Persian causative markers in the acquisition of English causative verbs
This project investigates the relationship between lexical semantics and causative morphology in the acquisition of causative/inchoative-related verbs in English as a foreign language by Iranian speakers. Results of translation and picture judgment task show although L2 learners have largely acquired the correct lexico-syntactic classification of verbs in English, they were constrained by ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007